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Abstract 

The computer data up to 2 w 1.76 x 10 13 on the gaps between consecutive 
twins is presented. The simple derivation of the heuristic formula describing 
computer results contained in the recent papers by P.F.Kelly and T. Pilling 
||, H is provided and compared with the "experimental" values. 
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Among the primes the subset of twin primes is distinguished: twins are such 
numbers (p, p') that both p and p' = p + 2 are prime. So the set of twins starts with 
(3, 5), (5, 7), (11, 13), (17, 19), (29, 31) .... It is not known whether there is infinity 
of twins; the largest known today pair of twins was found recently by Underbakke, 
Carmody, Gallot (see |ittp: / / www.utm.edu/research/primcs/largcst.htmll) and it is 
the following pair of 29603 digits numbers: 

1807318575 x 2 98305 ± 1. (1) 

The mathematicians are using the notation ^(iV) to denote the number of twins 
smaller than N and the Hardy and Littlewood conjecture B states that the 
number of twins below a given bound N should be approximately equal to 

f N d u N 

where the constant C2 (sometimes called "twin-prime" constant) is defined in the 
following way: 

C2 " 2 S(^(^) =L32032 -- (3) 

The problem of distribution of twins is very difficult. One of the characteristics 
one can use for this purpose is the statistics of gaps between consecutive twins. Let 
d denote the distance between two consecutive twins measured as a arithmetical 
difference between last primes constituting consecutive twins, thus for example for 
twins (29,31) and (17,19) d = 12. The distances can be only multiplicities of 6: 
d = 6k, because all twins are of the form 6k ± 10]. Next let m(d,N) denote the 
number of twins separated by d and smaller than N: 



m(d, N) = number of consecutive twins (p m ,Pm+i = Pm + 2) and (p n ,p n+ i = p n + 2) 

such that p n — p m+ i = d, and p n+ i < N (4) 

A few years ago I have obtained on the computer data for m(d, N) for N = 
2 26 , . . . iV = 2 44 [[J. A part of results is shown on the Fig.[T|. 

Recently there appeared two papers by P.F.Kelly and T. Pilling ||, ||. They 
have looked on the histogram of separations between consecutive twins measured 
by the number of primes in between, not just the arithmetical difference as above. 
Here I am providing heuristic formulas which describe findings of Kelly and Pilling 
as well as much larger than in and || computer data which corroborates the 
obtained analytical relations. 
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Figure 1: The plot showing the dependence of histogram of gaps between consecutive 
twins at N = 2 26 , 2 32 , 2 38 and 2 44 . There is linear scale on x axis and logarithmical 
scale on y scale. 
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Let fi(s, N) denote the number of consecutive twins smaller than iV and separated 
by s primes, with exception of primes constituting twins in question, i.e. for example 
for (5, 7) and (11, 13) s = while for (17, 19) and (29, 31) 8 = 1 (prime 23 lies in 
between). More strictly we adopt the following definition: 

fj,(s, N) = number of consecutive twins (p m ,p m +i = p m +2) and (p n ,p n+ i = p n +2) 

such that 7r(p n ) — ^(p-m+i) — 1 = s, and p n +i < N (5) 
Here n(N) is as usual the number of primes< N and 

f N du N 

The authors of || have found that /i(s, N) for a given N decreases exponentially 
with s. In fact these authors are working with relative frequencies //(s, N) / J2t N) 
but here I will use absolute values /i(s, N). 

I have made the computer search up to = 2 44 w 1.76 x 10 13 and counted the 
number of primes between consecutive twins. During the computer search the data 
representing the function /i(s, N) were stored at values of N forming the geometrical 
progression with the ratio 4, i.e. at 2 22 , . . . , 2 42 , 2 44 . The resulting curves are plotted 
in the Fig.|. 

Because the points lie on the straight lines on the semi-logarithmic scale, we can 
infer from the Fig.|2| the ansatz 

n(s,N) ~ A(N)e- BWs . (7) 

The functions A(N) and B(N), giving the intercepts and the slopes of straight 
lines seen in the Fig.|], can be determined by exploiting two identities that /jl(s, N) 
have to obey. First of all, the sum of u(s, N) over all s is the total number of twins 
< N: 

E ti*,*0 = *2{N). (8) 

Here s max (N) is the largest separation s between twins < N, see later discussion. 
The second selfconsistency condition comes from the observation, that 

su(s,N) = 7r(N)-2n 2 (N). (9) 

In fact the above sum starts with s = 1 what is important for the relations obtained 
below. Putting the ansatz (0) into (|§|) and ([]) and collecting appropriately terms we 
end up with the geometrical series with quotient e~ B ^ and (0) is a differentiated 
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Figure 2: The plot showing the dependence of the histogram (j,(s, N) on s at iV = 
2 28 , 2 30 , . . . , 2 44 . There is a logarithmical scale on the y-axis , while on the x-axis 
there is a linear scale. 
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geometrical series. Using the approximation and summing in (|8|) and (0) up to 
infinity instead of particular s max (which depends on N) I end up with the equations: 

A(N) ~MN) (10) 



1 -exp{-B(N)} 
and 



' [ ' 1 n(N)-2ir 2 (N). (11) 



;i - exp{-B(JV)}) s 



From the Fig.[|it is seen that the slopes B(N) are smaller than 1 and they decrease 
with N, e.g. B(2 42 ) = 0.0503..., B(2 U ) = 0.0483.... Thus in the above relations I 
can put exp{— B(N)} « 1 — B(N), B(N) <C 1 and solve the equations. Finally 
the main conjecture is: 

A(N) « - B(N) « ? ^ , (12) 

Putting here asymptotic forms of ir(N) and 7r 2 (iV) I make the following guess: 

r 2 N 2 r 

The comparison of these formulae is given in Table 1, where the values A t f ieor {N) 
and B theor (N) obtained from (|IUD and (fTT[) are divided by values A exp (N) and 
B exp (N) obtained by least-square method from data presented on Fig.|[ The actual 
numbers A exp and B exp depend on the number of points taken for linear regression 
method, results in Table 1 are obtained when I skipped 15 first points and 40% of 
last point, where large fluctuation appear (in fact only 2 points determine straight 
line). For the calculation of A t h eor [N) and B t heor{N) I have used exact values of 
7t(N) and 7r 2 (iV) at iV = 2 22 , . . . N = 2 44 from my earlier computer run 

All ratios tend to 1 with increasing N, as it should be and asymptotic values 
(|T3D are smaller than exact values, since in (0) and @ dots denote positive terms 
which were skipped. 

Another characteristic which can be used to test the conjecture fll2"|) is the ques- 
tion what are the largest separations s max (N) between twins up to a given N. It 
corresponds to problem of the maximal gaps between consecutive primes, what has 
a long history, see 0. Heuristic argument is that the maximal gap appears only 
once and hence it can be obtained from the equation: 

fi(s max (N),N) = l (14) 
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TABLE I 



N 


A exp (N)/A theor (N) 


B exp {N)/B theor {N) 


A eX p (N) /A aS y mp t (iV) 


Bexp(N) 1 B asyrnpt (N) 


2 22 


0.964932 


0.971223 


1.528465 


1.308701 


2 24 


0.983914 


0.986198 


1.475196 


1.283024 


2 26 


0.967421 


0.980584 


1.389283 


1.241010 


2 28 


0.972147 


0.982905 


1.353625 


1.219132 


2 30 


0.965933 


0.980277 


1.312914 


1.196643 


2 32 


0.970505 


0.983126 


1.291164 


1.183385 


2 u 


0.973114 


0.984629 


1.270698 


1.170806 


2 36 


0.975160 


0.985884 


1.252173 


1.159566 


2 38 


0.976340 


0.986722 


1.235622 


1.149610 


2 40 


0.977286 


0.987373 


1.220839 


1.140667 


2 42 


0.977457 


0.987681 


1.206974 


1.132435 


2 44 


0.978554 


0.988279 


1.195782 


1.125445 



From the main conjecture (|i~2"|) it follows that for large N: 

s max (N) » |M(2hi7r 2 (iV) -ln(7r(iV) - 2c 2 vr 2 (iV))) (15) 
For large iV it goes into the 

w(A0~-hi 2 (iV) (16) 
c 2 

what differs from the Cramer's conjecture for maximal gaps between consecutive 
primes by the factor l/c 2 . It differs also by on power of ln(iV) from the asymptotic 
behavior of maximal gaps between consecutive twins G 2 (iV) ~ ln 3 (iV) obtained just 
arithmetically: i.e. distance between twins p m ,p m+ i = p m + 2 and p n ,Pn+i = Pn + 2 
is simply p n — p m and not vr(p n ) — n(p m+1 ) — 1. The comparison of actual values of 
s m ax{N) obtained from the computer search are compared with formula ([15]) in the 
Fig.3. 

Finally I will make a remark on the problem of champions, i.e. the most often 
occurring gaps. For prime numbers it was treated in 0. In || the authors have 
made the assertion that the champions for twins with gaps measured in terms of 
the primes lying in between is always s = 0. From the Fig.l it is seen that when 
gaps between twins are arithmetical differences then there will be a set of champions 
consisting of d = 30, next emerging peak is d = 210 and so on, as it can be seen on 
the data presented in Fig.l. It will be discussed in more detail in the forthcoming 
paper. 

Finally let us notice that from the Fig. 2 it follows that the separations between 
twins measured by the number of primes in between follow exactly the Poissonian 
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behavior, see e.g. |Ttl. Interestingly the change in the "measuring sticks" removes 



oscillations from Fig.l and leaves pure exponential decrease. 
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The plot showing the dependence of s max (N) and comparison with formula 



